The Age-Specific Force of Natural Selection 
and Walls of Death 

Kenneth W. Wachter, Steven N. Evans, and 

David R. Steinsaltz 

30 June 2008 

Abstract 

CO ■ W. D. Hamilton's celebrated formula for the age-specific force of 

natural selection furnishes predictions for senescent mortality due to 

PQ ' mutation accumulation, at the price of reliance on a linear approx- 

Ph ■ imation. Applying to Hamilton's setting th e full non-linear d emo- 

O ! graphic model for mutation accumulation of Evans et al.l ( 20071 ) 
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O ' find surprising differences. Non-linear interactions cause the collapse 



of Hamilton-style predictions in the most commonly studied case, re- 
fine predictions in other cases, and allow Walls of Death at ages before 
the end of reproduction. Haldane's Principle for genetic load has an 
£> ■ exact but unfamiliar generalization. 
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c3 1 The Force of Selection 



The best-known formula at the intersection of genetics and demogra- 
phy is doubtless W. D. Hamilton's "age-specific force of natural selection". 



Hamiltonl (119661 ) differentiated a measure of fitness, Lotka's intrinsic rate of 
natural increase, with respect to an increment to age-specific mortality at 
an age a. Thus he obtained a linear approximation for loss in fitness due to 
any deleterious mutations that raised mortality at any one specific age. The 
greater the loss in fitness, the faster should mutant alleles be selected out 
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of a population and the fewer should be found at equilibrium as recurring 
mutations balance natural selection. 

By this route, Sir Peter Medawar's concept of mutation accumulation 
as an evolut i onary reason for se nesc ence takes on mathe matical form. As 
Medawarl ( 19521 ). iFinchl ( 19901 ). or ICharlesworthl (2000), the idea involves 



in 



genetic load produced by large numbers of mildly deleterious mutations oc- 
curring at widely separated loci, each with some sma l l age- specific effect on 
vital schedules. As developed by Brian ICharlesworthl (11994] ) this framework 
guides the interpretation of many experiments i n aging research and popula - 
tion genetics. A recent perspective is offered by lFlatt and Promislowl (120071 ). 
For definitions, see Section [2J 



Hamilton' s work, r e printe d in iHamiltonl (119951 ). has been assessed and 
extended by iBaudischl (120081 ). Sophisticated genetic mode ls of m u tation - 
selection balance are covered in an authoritative book by iBiirgerl (120001 ) . 
Demographers mainly put up with less sophisticated models of the genome 
in return for more refined treatments of age-specific structure, as we do 
here. Age-specific predictions for vital schedules are some times robust to 
details of genetic specification, in line with a principle of iHaldand (119371 ) 
which equates the population loss in fitness from genetic load to the to- 
tal mutation rate, independent of the form of action of mutations. Cur- 
rent inte rest has been stimu l ated by the expansion of b io demograph y, re- 
viewed inlWachter and Finchl (ll997h . lVaupel et ail (Il998h . [Carevl (J2003h . and 
Carey and Tuljapurkarl (120031 ). 



The theory built on Hamilton's formula is, in short, a centerpiece for 
demographic research. But Hamilton's formula fails to be self-consistent. 
Its reliance on linear approximation requires total increments to age-specific 
mortality to stay small where the formula predicts them to grow large. 

Non-linearity is built into the fitness measure. As deleterious mutations 
arise, their overall effect differs from the sum of their individual effects. Di- 
minished survival at any one reproductive age necessarily leaves less repro- 
duction to be lost by a drop in survival at any other reproductive age. This 
interaction, a key feature of mutation accumulation, is set to zero in the lin- 
ear framework. If all effects were small enough, little accuracy would be lost, 
but the prediction turns out to be for large cumulative effects from small 
individual ones. Richly suggestive as it is, Hamilton's formula can only be 
taken so far and no further. How far is too far? The answer has remained in 
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doubt. 



In Evans et al.l (120071 ) we developed a full non-linear age-specific model 
for mutation accumulation. In this paper we apply our model to Hamilton's 
setting, in which the effect on age-specific mortality of each deleterious mu- 
tation is concentrated at a single age. As explained shortly, these effects are 
called "point-mass increments" and they are added onto the continuous-age 
version of mortality rates called the "hazard function". This specification 
is not a realistic one for actual mutations, but it is of central interest given 
its dominant role in prior work. A companion paper in preparation applies 
the new model to cases with more plausible specifications of distributed mu- 
tational effects. With point-mass increments, we are fortunate to obtain 
closed-form expressions, a particular advantage since numerical simulations 
may not readily reveal whether finite solutions do or not exist. 

The most studie d case with point-mass increments was put forward by 



Charlesworthl (120011 ) . He showed that the widely-observed Gompertz-Makeham 
form for age-specific mortality would be predicted exactly by the linear ap- 
proximate model if one took mutation rates, background hazards, and fer- 
tility all constant across ages beyond an age of maturity. The Gompertz- 
Makeham form has a hazard function rising exponentially with age added 
onto a constant background level. Charlesworth was able to generalize the 
Gompertz-Makeham prediction beyond the point-mass setting. In his frame- 
work, a "Wall of Death" with infinite hazards and zero survivorship could 
occur, at but not before an upper age limit to fertility, if such an age limit 
were imposed. Charlesworth's elegant results with the linear approximate 
model have shaped thinking in biodemography. 

Our paper presents three main findings for the full non-linear model with 
point-mass increments, all of them surprising: 

1. In the elementary case with constant rates, the linear model breaks 
down when non- linear interactions are taken into account. An equilib- 
rium ceases to exist, and accumulating mutations drive survival to zero 
at every adult age. 

2. When provisions are introduced that preserve survival at some adult 
ages, a "Wall of Death" can occur before rather than at the oldest age 
of reproduction. 
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3. A generalization of Haldane's Principle holds exactly but it takes an 
unfamiliar form. 

These results demonstrate the limitations of the traditional approach 
grounded in Hamilton's formula. They furnish guidance for constructing 
more realistic specifications of mutational action. It is especially revealing 
to learn that Gompertz-Makeham hazard functions cannot withstand cer- 
tain simple kinds of sustained mutational pressure when interactions are 
taken into account. In the interpretation which we shall offer, equilibrium 
solutions disappear because they collapse to something outside the model 
specification. The "missing" equilibrium is a stylized life history in which 
all fertility is concentrated in a burst a t a sin gle age, followed by immediate 



death, a life history which iTuljapurkarl (119971 ) calls "the salmon limit" 



Like the linear models, our non-linear model is an infinite-population 
model in continuous time. The representation of the genetic structure is kept 
somewhat stylized in order to allow ramific ation of the demographi c struc - 



ture. The model follows in the tradition of iKimura and Maruyamal (119661 ). 
Inheritance is diploid, with random mating, weak selection, and fitness cal- 
culated for individuals rather than for mating pairs. At each of a large or 
infinite number of sites is found either a wild-type or a dominant deleterious 
mutant allele. Alleles with the same effect on the hazard function are treated 
as if they were copies of the same allele, though found at different sites. Back 
mutation is taken to be negligible. A randomly selected member of the pop- 
ulation carries some collection of such mutant alleles, and the state of the 
population is described by a joint probability distribution for the counts of 
alleles of different kinds, that is, of alleles acting at different ages. The lin- 
ear models posit Poisson di stributions, consistent with a derivation found on 



page 137 of iDurrettl (120021 ). Our model recovers this Poisson property for 



solutions, in a sense explained in Section [31 The prop erty follows f r om a n 



assumption called "Free Recombination," spelled out in lEvans et al.l (120071 ) 



Genetic recombination makes no difference when interactions are all sup- 
pressed by linear approximations, but it matters in non-linear models. Un- 
der Free Recombination, recombination is assumed to act on a more rapid 
timescale than mutation and selection, erasing linkage disequilibrium among 
sites involved in the mutation accumulation proces s. An alternative non- 



linear model without recombination is presented in ISteinsaltz et al.l (120051 ) 
but will not be treated in this paper. 
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When age like time is taken to be continuous, adding an increment to 
the hazard function "at" an age a does not strictly make sense. The remedy 
is to think of adding a point mass or "delta function" of some size r\ to the 
hazard function, equivalent to adding onto the cumulative hazard function 
a step function with a single upward step of size 77 at age a. Hamilton's 
formula then follows by differentiating Lotka's r with respect to 77 applying 
implicit differentiation to the Euler-Lotka Equation which defines r. In the 
application of our model to Hamilton's setting, the mutational effects are 
point-mass increments. 

Hamilton's formula applies in principle to favorable as well as to deleteri- 
ous mutations. However, outside the context of mutation accumulation the 
formula is uninformative with respect to age-specific shapes of vital schedules. 
Any recurring favorable mutation eventually goes to fixation. The rapidity of 
fixation does not matter to the ultimate contribution. Transient effects are 
possible, but those would depend on independent definition of some origin for 
time. Parallel studies of fertility are also of interest but outside our present 
scope. Here we concentrate on deleterious mutations affecting survival and 
their demographic consequences. 



2 The Linear Approximation 



Our non-linear model is derived from dynamic equations in lEvans et al.l 



( 120071 ) and may be regarded as a limiting form of standar d dis cret e-p opulat ion 



geneti c models such as those of lBarton and Turellil (119911 ) and lKirkpatrick et al 



( 120021 ) in the asymptotic regime of weak selection and mutation. Although 
the derivation is complicated, the formulas for predicted hazards are simple. 
They are presented in Section [3J An informal account of why the answers 
take the shape they do is offered in this section through an examination of 
the linear approximation built into Hamilton's formula. 

For the convenience of readers, we review some demographic terminol- 
ogy. Let C be a random variable that represents the life-span of an individual 
picked at random from a population, or from a subpopulation sharing some 
attribute. The survivorship function l x := P{£ > 2} is the probability of 
survival from birth to age x. The cumulative hazard at age x is — log^. The 
hazard function itself at age x is minus the derivative of the logarithm of the 
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survivorship, when the derivative exists. The product of the survivorship 
function l x with the age-specific fertility rate f x is the Net Maternity Func- 
tion f x l x . Fertility for males is taken to be governed by the rates for their 
female mates. The models do not explicitly segregate individuals by sex. 
The area under the Net Maternity Function (that is, J„ f x l x dx) is the Net 
Reproduction Ratio or NRR. The NRR measures generational replacement, 
the ratio of the size of the next generation to the current one. 

If the survivorship function l x and fertility schedule f x do not change 
over time, then they lead to a population with an unchanging proportional 
distribution of ages called a stable population. The growth rate of the stable 
population (the slope of the logarithm of population size over time) is Lotka 's 
intrinsic rate of natural increase r. Lotka's r is the unique real root of the 
Euler-Lotka Equation 1 = J e~ rx f x l x dx. That is, r is the unique real zero 
of the logarithm of the Laplace Transform of the Net Maternity Function. 
On evolutionary timescales, the parameter r is assumed to have been close 
to zero, the growth rate of a stationary population. 

Hamilton's expression for the force of natural selection is a function of the 
age a at which the effect of a mutant allele is assumed to be concentrated. 
It is calculated from background or extrinsic schedules for l x and f x , called 
the baseline schedules. If time is measured in units of generations, with 
generation length equal to the stable population mean age at childbearing, 
and if Lotka's parameter r is set to the stationary level r = 0, the force of 
selection reduces to the simple form w(a) = J a °° f x l x dx, the expected number 
of offspring produced after age a by an individual chosen at random from the 
population, offspring that would be lost by death at age a. With time in 
units of generations, the expression is the same whether the fitness measure 
being differentiated is Lotka's r or the Net Reproduction Ratio. 

As we have said, Hamilton's expression gives a linear approximation to 
the fitness cost of mutations which add point-mass increments of some size rj 
to the hazard at age a. The loss from n such mutations is being approximated 
by nr] times w(a), even though the fitness itself depends exponentially on n 
and r\. Deleterious mutations are taken to be occurring at a rate v{a) and 
accumulating. For outflow nnw(a) due to selection to balance inflow v(a) 
due to mutation at equilibrium, the linear model sets n ~ u(a)/(r)w(a)). 
The approximation for the increment to the hazard function is then nrj 



{nv)/{rjw) = v/w, independent of the size n of the effect. See ICharlesworth 
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(]200ll ) for a full account. 



A contradiction arises because mutational effects of tiny size imply equi- 
librium increments to the hazard function of hefty size. Small values of 77, 
which ought to help the approximation, strain it by bolstering the equilib- 
rium value for n. The force on any one of n alleles acting at a responds to 
alterations to the survivorship function due to the other n — 1 alleles, as well 
as to alterations due to alleles acting at other ages. In other words, interac- 
tion terms enter the calculation. For consistency, w ought to be computed 
from a new l x , reflecting these alterations. But then we would have new 
increments to the hazard function and need a new new L, over and over. 



'•.!'• 



The hazard function at equilbrium computed from the survivorship func- 
tion for the population as a whole may be written as a sum of a contri- 
bution X(x) due to baseline risks and increments h(x) due to the mutant 
alleles distributed across the population. The linear approximation based on 
Hamilton's formula puts 

h ( a) « i^i = V M (i) 

w(a) f a °° exp (- /* \{y)dy) f x dx 

Since all increments affect selective costs, a guess at correction might be 
to replace the baseline hazard with the total hazard, leading to an equation 
with h on both sides: 

. . v (a) . . 

ir ex p (- Jo ( x (y) + %)) d y) /* dx 

This guess turns out to be the actual equation for h with point-mass incre- 
ments in the full non-linear model. 

The simplicity of Equation (j2j) hides some subtleties. The function h in 
(J2J) is the hazard function computed from the population survivorship func- 
tion. Also called the "aggregate" hazard function, it differs from the average 
hazard function. Across the population, the set of mutant alleles present in 
each individual is random. Different members carry different genetic loads. 
Without such heterogeneity, natural selection would have nothing to select. 
Members carrying heavier loads tend to die at younger ages. This process 
of "demographic selection" makes hazards observed among survivors lower 
than hazards evaluated by averaging over the distribution of alleles inherited 
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at birth. It is not obvious whether formulas should feature the average haz- 
ard or the aggregate hazard. As shown in Section [31 the non-linear model 
picks out the aggregate hazard, as a consequence of computing a statistical 
expectation value for the marginal cost of each mutant allele. 

Another point concerns the specification of fitness costs in the presence 
of heterogeneity. When linear approximations are being used, it makes no 
difference whether costs are based on L otka's r or on t he Ne t Reproduction 



Ratio, the NRR. Pages 136 to 146 of ICharlesworthl (11994J ) give a careful 



examination of first-order and second-order terms. For non-linear models, the 



choice does make some difference. As ICharlesworthl (120001 ) points out on page 



930, the NRR is the appropriate fitness measure for our purposes. The alleles 
are not invading a population but are being held at equilibrium frequencies. 
Measuring selective cost by reductions in the NRR makes frequencies agree 
with classical formulas for single-locus models. 

The demographic background to the specification of selective costs may be 
clarified by reference to stable population theory. The population members 
who carry a particular collection of mutant alleles make a contribution to the 
next generation given by their mutation-dependent NRR. Thanks to new 
mutations as well as to recombination, their offspring do not carry identical 
collections of alleles. Groups of carriers are broken up each generation, before 
they establish their own special stable age structure or their own special 
values of Lotka's r, the growth rate that occurs with a stable age structure. 
At equilibrium all groups of carriers share the same growth rate, since their 
numbers are replenished by new mutations to balance their loss in numbers 
due to natural selection. For this reason, the NRR rather than r determines 
selective costs for mutation accumulation. 

We are now in a position to preview our chief results. In the linear theory, 
the force of selection w(a) is non-zero for any age at which the net maternity 
function calculated from the baseline schedules is non-zero. Non-zero w(a) 
implies finite equilibrium numbers of mutant alleles acting at each age a. 
Numbers may tend to infinity as a approaches the last age of reproduction, 
imposing a "Wall of Death" at that age but not before. Late-acting mutations 
have no impact on the numbers of earlier- acting mutations. Selective pressure 
at an age abutting on the Wall of Death is calculated as if there were no Wall 
of Death. 

In the full non-linear theory, in contrast, a Wall of Death, by erasing all 
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net reproduction beyond itself, reduces selective pressure against mutations 
acting at slightly younger ages. Reduced pressure may mean that selection 
cannot balance the rate of new mutations and equilibrium conditions cannot 
be satisfied at a younger age. The Wall of Death at the older age comes 
to imply a Wall of Death already at a slightly younger age. If this chain of 
implication proceeds unchecked through younger and younger ages, it may 
mean that no equilibrium solution with finite mean numbers of mutations 
exists at all. We say that "the solution unravels", or, more precisely, that 
an attempt to solve for the equilibrium distribution of mutant alleles in the 
population unravels. We show in Section [5] that such pathology can in fact 
occur. 

Unraveling can be prevented in some biologically sensible ways. But 
equilibrium mean numbers of mutations can still go to infinity at ages before 
the last age of net reproduction, as we show in Section [61 A Wall of Death 
may be found at earlier ages than the linear framework permits. 

Mutation accumulation has the appealing property that predictions are 
insensitive to some of the de tails of specific ation. Working with genetic 



models without age structure, iHaldand (119371 ). page 341, announced that 



"... the loss of fitness to the species depends entirely on the 
mutation rate and not at all on the effect of the gene upon fitness 
of the individual carrying it ..." 

He found the sum total of mutation rates at different sites to be approxi- 
mately equal to the resulting decrement in the logarithm of fitness, a measure 
of "genetic load" essentially equivalent to our selective cost. 

Haldane worked with two alleles per site and imposed a strong assump- 
tion about independence of mutational effects. Generaliz ations to multi-allele 



models are proved on pages 105 to 112 and 143 to 153 of lBurgerl (120001 ). and 
a version holds in the age-specific linear framework. In our non-linear model 
with point-mass increments, the aggregate population hazard and the popu- 
lation loss in fitness are strictly independent of the sizes of the increments, as 
we show in Section [71 The total mutation rate, however, is not equal to the 
total selective cost, but to a less conventional function of the demographic 
schedules. 
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3 The Non-Linear Model 



We now present the non-linear model from lEvans et al.l (120071 ) and show 
how it leads to the equation ([2]) for predicted aggregate hazards in the special 
case of point-mass increments. 

Mutant alleles m, distinguished by their age-specific effects, are drawn 
from a space Ai. In our point-mass setting, m corresponds to an age of 
action or "age of onset" , a point on the real line, and Ai is the positive real 
line itself. Each individual carries some finite batch of mutant alleles denoted 
by the letter g, and we use the word "genotype" as shorthand to refer to it. 
A member who carries no mutant alleles is said to carry the "null genotype" 
g — 0, with wild-type alleles at every site. 

An individual sampled at random from the population carries a random 
batch of points G. The count of points of G in any interval of M. is a 
random variable. The mean of this random variable is given by the area 
within the interval under a curve p called the intensity. (Technically speak- 
ing, p is a density with respect to Lebesgue measure on the l ine.) In the 



non-li near model under the assumption of Free Recombination, lEvans et al 



( 120071 ) prove that the random counts in disjoint intervals are independent 
random variables with Poisson distributions. Such a Poisson point process is 
uniquely determined by its intensity. It can also be defined for more general 
choices for M. including sample spaces for stochastic processes. The points of 
the Poisson process are points of age, not points of time. The Poisson prop- 
erty holds at any given time and also at equilibrium, if an equilib rium exists. 



Backg round on Poisson processes may be found in textbooks like iKallenberg 



(120021 ). Chapter 12. 



Each application of the general model requires three ingredients: the 
age-specific profiles for the actions of mutant alleles, the rate at which new 
mutations enter the population, and the selective cost which gradually drives 
mutant alleles out of the population. 

Here each mutation profile is written as a function K(m, x) of the index 
m and an age variable x. The function K(m, x) is multiplied by a size fac- 
tor n(m) and added onto the cumulative hazard function. In other words, 
the cumulative hazard function defined for a subpopulation of individuals 
with genotype g is formed by starting with the cumulative baseline hazard 
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and adding a term n(m)K(m,x) for each point m in the batch of points 
g. In demographic language, alleles act like independent competing risks in 
a multiple decrement lifetable. Oth er interes t ing fo rms of action including 



proportional hazards are studied by iBaudischl (J2008I ). Chapter 2 



In this paper, following Hamilton, we take the profile k(tti, x) to be a 
step function with a unit step at the age of onset for m, corresponding to a 
point-mass increment to the hazard itself. Equations do not depend on this 
special choice for k until Equation (jSJ) at the start of Section HI 

The number of new mutations per generation with m from some interval 
of M. is given by the area within the interval under a curve v(a) called the mu- 
tation rate. Like the intensity p, the mutation rate is a densit y with respect 



to Leb esgue measure. As in the elementary cases studied by ICharlesworth 



( 12001I ). we usually take vim) to be constant or nearly constant across ages 
within the reproductive span because we are interested in structure that 
arises from the logic of natural selection rather than from structure arbi- 
trarily built into assumed mutation rates. Concentrating on adult mortality, 
we make v vanish for ages of onset below some age at maturity a at which 
fertility and exogenous baseline mortality commence. The level of fertility 
is tuned to produce a stationary population at equilibrium, cancelling out 
effects of juvenile mortality and letting us omit them here. 

The selective cost function S is a non-negative function of g, here taken 
equal to the decrement in the NRR due to the mutant alleles included in g. 
This choice was explained in Section [2j Formulas are given in Equations (jSJ) 
and (JBJ) below. 

Since a Poiss on point process i s uniquely determined by its intensity, 



the theorems in lEvans et al.l (120071 ) allow the population over time to be 
described by an equation for the intensity pt{m) over time. This dynamic 
equation involves an expectation value, written E p , which averages over the 
random genotypes G of randomly selected members of the population, using 
the intensity function for the Poisson process: 

^^ = u(m) - Pt(m)E pt [S(G + 8 m ) - S(G)] (3) 

The symbol G + 5 m denotes a genotype with all the mutant alleles in G plus 
one copy of m, using 5 m to stand for the measure which puts mass 1 on 
the point m. Equation ([3]) sets change equal to inflow minus outflow, with 
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inflow given by the mutation rate and outflow given by mean numbers times 
the marginal selective cost of each additional mutant allele. An equilibrium 
intensity p, if one exists, has the left-hand side equal to zero, requiring p to 

satisfy 

v{m) = p{m) E p [S(G + 5 m ) - S(G)} (4) 

We calculate predicted hazard functions at equilibrium by substituting 
demographically meaningful expressions for S(g + 5 m ) and S(g) for each 
fixed g. The selective cost function S measures the fitness of genotypes 
on an implicitly logarithmic scale. We continue to write f x for the fixed 
baseline age-specific fertility schedule and X(x) for the baseline hazard rate, 
so that the survivorship function for the null genotype is given by l x (0) = 
exp(— J X \{a)da). 

The function S can be defined to equal 

S(g) ■= / f x l x (0)dx- / f x l x (g)dx (5) 

The integrals are all taken over ages for which the integrands are non-zero. 
The function l x (g), the probability of survival to age x for members with 
genotype g, is derived from hazards that include increments from mutations 
in g. We add up the increments on top of the baseline to form the cumulative 
hazard, insert a minus sign, and exponentiate to recover the survivorship: 



l x {g) ■= lx(0) exp I - ^2 v( m ' 

\ m'£g 



) K,(m',x) ) (6) 

eg 

The prime on m is a reminder not to confuse the index of summation with 
the mutant allele m whose marginal cost we seek to calculate. 

Adding a copy of some particular mutant allele m to the alleles already 
in g multiplies this survivorship by exp(— r](m)K(m, x)), with a marginal cost 
equal to 

POD 

S(g + 5 m )-S(g)= (1 - e-^ m ^ m ^)f x l x (g) dx (7) 



The theory tells us that our non-linear counterpart to Hamilton's age- 
specific force of natural selection is the expectation value of this marginal 
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cost, formed by letting g range over randomly selected genotypes G from 
the Poisson point process. The integrand factors into a fixed part involving 
only the extra allele m and a random part involving only G, namely the 
net maternity f x l x {G) for the subpopulation with genotype g = G. By 
Equation (jSJ), the net maternity function resembles a Laplace Transform for 
the distribution of G. This Poisson Proce ss expectat i on ca n be taken in 



closed form, as, for instance, on page 227 of iKallenbera (J2002J): 



E p [l x (G)) = l x (0) exp ( - / (1 - e -*?K)«K,*)) p ( m ')d m > ) (8) 



M 



Solutions with Point Mass Profiles 



In Section [3J the expressions do not depend on the form of k. Now we 
specialize to point-mass profiles. The step-function form for k leads to further 
simplification. We substitute from Equation ([7]) in the equilibrium formula, 
Equation (jlj), replacing m with its age of onset a. 

poo 

u(a)=p(a) / (l-e-^« a ^)E p [f x l x (G)]dx (9) 



The step function preceding the expectation value restricts the range of 
integration to x > a, replacing a by a, since the integrand vanishes for x 
below the age of onset. The random factor also simplifies, since the integral 
over M. in Equation ([8]) can be replaced by an integral over ages from a to 
x. It can be written in terms of the increment h to the aggregate hazard 
discussed at length in Section [2j The increment h is defined by 

h(x) := --^log(E p [l x (G)/l x (0)]) (10) 

We can express the aggregate population survivorship function in terms of 
h: 

E p [l x (G)]=e^(-JjX(y)+h(y)]dyY (11) 

We have decomposed the aggregate population hazard into independent com- 
peting risks A due to baseline and h due to genetic load. When we substitute 
f fTT]) in Qj and compare with (JHJ), we see how h depends on p and r\: 

h(y):=(l-e-^)p(y) (12) 
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Furthermore, we see that Equation ([9]) is equivalent to the expression intro- 
duced in Section [2], Equation (J2J). As promised, we have derived the equation 
for h from the general equations of the non-linear model. 

We now specialize our choices of baseline schedules for the sake of our 
main applications. We choose a value for a, the age of maturity below which 
exogenous baseline mortality and fertility are taken to vanish. Beyond a, 
the baseline hazard is taken to be a constant A and fertility is taken to be 
a constant / tuned to produce a stationary population at equilibrium. We 
begin by imposing no upper age limit on fertility and later consider cases 
with fertility set back to zero beyond an age (3 < oo. 

Our goal in the rest of this section is to rewrite the equilibrium equation 
for h in a form that is easier to solve in special cases. We introduce notation 
for the indefinite integral of the aggregate population survivorship function 

T{a):= exp(-/ [A + h{y)]dy J dx (13) 

This quantity T(a), often written T a , is the same as the column for "remain- 
ing person-years lived" in the population lifetable. Since for simplicity we 
have been omitting juvenile mortality, life expectancy £ at the age of matu- 
rity a is the same as T(a) and equal to the reciprocal of the level of constant 
fertility required for stationarity: 

£ = T (a) = T(0) - a = 1/f. (14) 

The equilibrium condition §§§ can now be written in terms of h(a) and 
T(a): 

POO / PX \ 

u(a) = h(a) / / exp I — / (A + h(y))dy J dx = h(a) f T(a) (15) 



Since —T'(a) is the survivorship function and minus the derivative of log 
survivorship is the aggregate hazard, X+h(a) is T"(a)/T'(a). The equilibrium 
condition (Q therefore implies that T(a) must satisfy a non-linear second- 
order differential equation: 

"M = (^- A ) /r(a) (16) 
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Solution of this differential equation is facilitated by exploiting the mono- 
tonicity of T(a) to change variables from age a to person-years r = T(a). We 
make use of the inverse function T _1 which is defined to satisfy T~ 1 (T(a)) = 
a and T^T" 1 ^)) = r. The symbol o denotes composition of functions: 
T^iT^a)) is the same as T" 1 oT(a). Aggregate survivorship is expressed as 
a function of r by composing the derivative of T(a) with the inverse of T(a): 

L(t) := -T'oT-\t) (17) 

With this definition, L(T(a)) = E p [Z a (G)]. The function L is easy to inter- 
pret. The last r person-years lived by members of a cohort are lived by the 
fraction L of the members. 

The derivative of L(t) with respect to r comes out to be the hazard 
expressed as a function of r: 

rjiff rji — \f \ 

^)= _ T ;oT- 1 (V " oT ' 1(r)+A (18) 

From the definition of £, we have L(£) = 1. The function L must also satisfy 
the boundary condition L(0) = with L(t) > for r > 0. At an age to 
which no one survives, there are no remaining person-years to live. 

We can rewrite v = hfT in the form h = u/(fT), substitute for h, and 
express both sides as functions of r: 

i'M = A+^M (19) 



5 Unraveling 



The simplest case of mutation accumulation with point-mass profiles takes 
the mutation rate v to be a constant z/ at all reproductive ages. 

In th e linear framework, this case is the starting-point for the notable 



results of lCharlesworthl (1200 ll ). discussed in Section [U With a constant back- 
ground hazard at adult ages, we have baseline survival, remaining person- 
years lived, and Hamilton's force of natural selection all going down expo- 
nentially with age. With a constant mutation rate in the numerator and the 
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force in the denominator of the linear approximate formula, Equation (0Q), we 
have the mean intensity of mutations and the increment to the population 
hazard both going up exponentially with age, achieving a total hazard equal 
to an exponential plus a constant, a Gompertz-Makeham form. 

What impact do the non-linear interactions suppressed in Hamilton's for- 
mula turn out to have on this important prediction? We seek solutions to 
Equation Q) with v(a) = u . 

Plugging in a constant for z/, the solution to the differential equation (TT9l) 
including a constant of integration A is given by 

L(t) = \t + (u /f) log(r) + A (20) 

We expect to determine the constant of integration from the initial condition 
L(£) — l a — Iq — 1, since T(a) = £. We also expect to set / = l/£ to tune 
the population growth rate to stationary levels. A puzzle arises, because £, 
the life expectancy at maturity, is as yet an unknown quantity that should 
be determined by the equations, while it appears that different values of £ 
can correspond to different choices for A. But a deeper problem intervenes. 
The quantity log(r) goes to minus infinity as the remaining person-years of 
life r go to zero, forcing there to be some non-zero value of r for which L(r) 
vanishes. But our model requires that L(r) = only when r = 0. We 
cannot have further person- years to be lived when no survivors remain. This 
contradiction shows that the non-linear model has no equilibrium solution in 
this elementary case. This unexpected finding is the first of our main results. 

We can visualize the disappearance of an equilibrium in several ways, 
with respect to age, with respect to time, or with respect to the shape of the 
mutation rate function. 

The picture with respect to age has been mentioned in Section [2J We 
give a sketch rather than a formal argument. We seek to construct a solution 
satisfying the equilibrium condition (1151) restricted to some late range of ages 
[z, oo) and then seek to extend the construction backward to earlier ages. At 
late ages, selective pressure is driven to low levels by the background hazard 
as well as by any late-acting alleles. Beyond some sufficiently late z, we 
expect the pressure to be too weak to be balancing a rate of new mutations 
that does not drop with age. We therefore expect infinite h and a Wall of 
Death. At the next earlier ages, the elimination of later reproduction by the 
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Wall of Death again leaves weak selective pressure. The interaction between 
older and younger ages, missing from the linear approximate model, puts 
T(a) near zero. Low T(a) requires high h(a) to make h(a)f'T(a) as large as 
vq but high h(a) implies even lower T(a), demanding, as it turns out, that 
the Wall of Death be earlier. Each Wall of Death implies an earlier Wall 
of Death, the instability propagates down through the whole reproductive 
span, and our construction unravels. 

A complementary picture with respect to time is implied by the dynamic 
equation ([3]) with the null genotype as starting state. A steady influx of 
mutations affects the whole reproductive span, and h t (a) begins to increase 
over time like z/ t at all ages. At older ages, where selective pressure is 
always low, this linear increase continues unabated, whereas at younger ages 
it is slowed for a while by outflow due to natural selection maintained by 
substantial values of fT t (a). The function ht (a) at a snapshot in time has 
an age profile which keeps low for a stretch of ages, climbs as T t (a) drops 
off with age, and settles out at z/ 1. As time goes by, the climbing phase 
accompanying the drop in T t (a) shifts down to younger and younger ages, 
until the hazard rate at every adult age comes to be marching toward infinity. 

Details of the dynamics depend on assumptions about fertility. We may 
hold fertility fixed over time, but we have to recognize that no fixed fertility 
level is sufficient for stationary population growth at equilibrium when there 
is no equilibrium. Unbounded accumulation of mutations across the whole 
reproductive span drives any population to extinction. We may instead let 
fertility levels adjust over time to maintain stationarity with current values 
of h t (a), on an assumption that feedback between resources and population 
growth operates on a faster timescale than mutation and selection. Under 
this scenario, the climbing phase in the age profile of ht(a) steepens with age 
and time as it shifts to younger ages, and the fertility level heads toward 
infinity. 

A third way of picturing the disappearance of an equilibrium makes use 
of results from the next section about mutation rate functions with different 
shapes. Section [6] displays a family of examples in which the mutation rates 
are nearly constant but not exactly constant. Each mutation rate function 
has a drop down to zero with some characteristic steepness which turns out 
to produce an equilibrium with a Wall of Death. The nearer the mutation 
rate function to constancy, the nearer is the Wall of Death to the age at 
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maturity, the shorter the reproductive lifetime, and the higher the fertility 
level required for stationarity. In order to reach the case of a wholly constant 
mutation rate function the brief high burst of fertility before death would 
have to turn into a delta function or point mass at the age of maturity, 
followed by immediate death. 



Such a stylized life history, called the "salmon limit" by iTuljapurkar 



( 119971 ). is a far cry from the smooth Gompertz-Makeham equilibrium from 
the linear approximate model. Non-linear interactions make the Gompertz- 
Makeham form collapse, leaving no smooth equilibrium for the elementary 
case of constant mutation rates with point-mass increments. 

This outcome depends on recombination. Recombination spreads the 
deleterious alleles throughout the population, leaving no lineages untouched 
by a surfeit of late-acting mutant alleles. In the absence of recombination, 



as shown in ISteinsaltz et al.l (120051 ) , a minority group of high-fitness lineages 
can keep the aggregate population hazard finite at younger ages in the face 
of constant mutation rates and a late-age Wall of Death. 

It is remarkable that the collapse of the equilibrium in our model with Free 
Recombination does not depend on the magnitude of the uniform mutation 
rate v. Even the tiniest such rate cannot be balanced by the force of natural 
selection. The elementary case with constant mutation rates and point-mass 
increments is the most studied case for the linear approximate model. The 
transforming effect of non-linear interactions for this case of all cases is a 
dramatic denouement. 



6 Walls of Death 



Unraveling can be avoide d in several biolo gically reasonable ways, already 



adumbrated, for instance, in IWachterl (119971 ). pages llff. There may not be 



mutant alleles whose effects are entirely concentrated at or above an age of 
onset. An "entry cost" of some small loss of fitness at young ages associated 
with all later-acting mutations will keep unraveling in check. Restricting 
ages of onset to some finite subset of discrete ages will also suffice. Here 
we examine another alternative, mutation rates for point-mass increments 
that remain only nearly constant with age of onset and drop to zero at what 
comes to be the end of life. 
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We construct a family of cases with nearly constant mutation rates by 
seeking forms for the remaining person-years function T(a) satisfying Equa- 
tion f|T9|) consistent with a predetermined relationship between T(a) and 
u(a). We then construct u{a) to validate this relationship and deduce h(a) 
from T(a). Our cases are indexed by a positive exponent 9 less than or equal 
to 1. Our intended relationship between T(a) and u(a) for a > a takes the 
form 

u(a)=u o (T(a)/T(a)) (21) 

When 9 is close to zero, the rate starts at v(a) = z/ and remains nearly 
constant until close to the end of life. 

With v as in Equation (12T1) . the differential equation (1T91) takes the form 

L'(r)=\ + (fO- 1 Mr/^ 1 (22) 

Its solution is 

L(t) = At + (/^-Vo^-V + A (23) 

Applying our boundary condition L(0) = implies A = 0. With /£ = 1 
for stationary growth and L(£) = 1 by the definition of £, life expectancy at 
maturity is given by 

«=rnw» (24) 

Higher mutation rates lower life expectancy, as they should, and so does 
slower tapering of the mutation rates with lower 9. With no mutations, 
u = 0, survival drops exponentially at the rate A, and £ = 1/A. 

We can express a as a function of r, because we now know the derivative 
of a with respect to r, the reciprocal of —L(r). 

r T{a) -dr_ ft <h_ 

a a ~ Ma) W) ~ 7r(.) Ar+(/^)-^-^ ^ ( } 

This expression can be integrated in closed form and the answer inverted 
to give t as a function of age. We change notation for age from a to x for 
subsequent clarity: 

r(x) = £ ((1 + ^)(exp(-A(l - 9)(x - a))) - ^^ (26) 
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When we substitute into L'(r), we find the contribution of genetic load to 
the hazard rate to be 

hM = i/ exp(A(l-fl)(s-a)) 

W " (l + v /(\9))-v /(\e)exp(\(l-6)(x-a)) l ' 

The hazard rate goes to infinity as x approaches 

For 9 < 1, u is the age of a Wall of Death. 

We now need to write down a mutation rate v as a function of age which 
satisfies the posited relationship with T and leads to (|2"7|) as an equilibrium 
solution. The posited relationship follows easily by raising t(x)/£ from (|26l) 
to the power 9. However, this relationship does not tell us how to define v(x) 
beyond the age uj at which r vanishes. 

It is tempting to define v{x) = beyond u>, but if baseline fertility remains 
positive beyond cu, the resulting equilibrium will not be the limit over time 
of the dynamical process starting from the null genotype. Late-age fertility 
will keep ht(a) bounded and prevent the Wall of Death. A second alternative 
is to terminate the baseline reproductive span at an age (3 equal to uj. As 
with the linear approximate model, a limiting Wall of Death does then occur 
at the end of reproduction. 

A third alternative is to set v(x) back equal to some positive constant 
after its drop to zero at uj. The rate is pinched to zero at uj rather than 
cut off. The dynamical process starting from the null genotype does then 
converge to the equilibrium given by (l2"Tj) . This alternative is of theoretical 
interest, showing as it does that a Wall of Death can occur before rather than 
at the last age of reproduction in the full non-linear model. 

The behavior of our family of cases as the exponent 9 approaches zero 
has been discussed in Section [5j The mutation rate becomes more and more 
nearly constant and the Wall of Death at uj moves down to the age of matu- 
rity, wiping out all fertility. 

For ages x well below the Wall of Death, the denominator of Equation 
fl27|) is nearly constant, and the hazard along with the constant baseline con- 
tribution approximates a Gompertz-Makeham form. As x comes closer to the 
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Wall of Death, the hazard function becomes hyperexponential. Derivatives 
of all orders go to infinity. 

The predicted hazards from the linear model with the same form for v 
have exponential increase at the same rate as the numerator in Equation (1271) . 
If no upper age limit is imposed on the reproductive span, the Gompertz 
pattern continues out to infinity. If an upper age limit is imposed, the linear 
model has a Wall of Death at that age, but not before. 

Mutation rates that drop sufficiently rapidly with ages of onset will pre- 
vent Walls of Death in the absence of an upper limit to ages of reproduction. 
Indeed, Equation ffl5l) allows us to start with a fairly arbitrary target shape 
for h(a) and find a set of mutation rates u(a) which will generate it. The 
case 6 = 1 from our family of cases has an exponentially declining mutation 
rate which produces an increment to the hazard rate that is constant over 
age. 



7 Haldane's Principle 



Our equilibrium condition ( fl5|) encapsulates a notable result. In our 
non-linear model with point-mass increments, if an equilibrium exists, the 
aggregate population hazard maintained by mutation-selection balance at 
equilibrium does not depend on the sizes 77(a) of the mutational effects. 

This result generalizes a property of linear approximate models, but in 
a surprising direction. Scaling up the size of the effect of a mutation in- 
creases the selective pressure against it and reduces its expected frequency. 
In the linear setting, the expected hazard, that is, the average hazard aver- 
aged across the population, is insensitive to 77(a). Doubling 77(a) halves the 
expected frequency p(a) and leaves the expected hazard 7?(a)p(a) unchanged. 
In our full non-linear setting, it is not the expected hazard but the aggregate 
population hazard that comes out to be invariant to changes in 77(a). 

The difference between aggregate and expected hazards is a preoccupation 
of social scientists. It is due to demographic selection or culling, already 
discussed in Section [2j The population is heterogeneous. Some members 
have genotypes with more mutant alleles and lower net reproduction. Others 
have fewer mutant alleles and higher net reproduction. The heterogeneity 
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is what allows natural selection to operate over time, and it is also what 
produces demographic selection with increasing age. Members with more 
mutant alleles die at younger ages, and survivors to later ages carry smaller 
than average sets of mutant alleles. The aggregate population hazard, based 
on proportions surviving, is lower at advanced ages than the expected hazard, 
which averages over all members, irrespective of survival. 

In the linear framework, one way of calculating overall selective costs 
makes them equal the total mutation rate. The contribution from a number 
u(a)/(rjw(a)) of mutant alleles acting at a, each with a cost r)w(a), amounts 
to u(a), and the integral over ages of onset equals the total mutation rate. 
However, calculating selective cost directly from the predicted hazard gives 
a different answer, reflecting the inconsistencies built into the linear approx- 
imations. 

In the full non-linear model, the change in expected frequency p(a) which 
compensates for a change in effect size 77(a) is only approximately linear in 
1 /77(a). But the non-linearity of the change exactly balances the non-linear 
effect of culling. Cases in which most population members carry only a 
handful of mutant alleles, each with a large effect, lead to the same aggregate 
hazard as cases in which almost all population members carry a huge number 
of mutant alleles, each with a tiny effect. The variance in net reproduction 
across the heterogeneous population is very different in the two settings, but 
the aggregate hazard, the most readily observable outcome, is the same. 

The total mutation rate, obtained by integrating over m, turns out to 
equal a function of the aggregate population survivorship K p [l x (G)}. The 
expectation is taken over the Poisson point-process distribution for G deter- 
mined by the equilibrium intensity p. 

/■oo 

u(M) = / (- log (E p [l x (G)/l x (0)})) f x E p [l x (G)} dx (29) 

The right-hand side is a fertility-weighted version of lifetable e ntropy , 



described with references in the textbook by iKeyfitz and Caswelll ( 120051 ). 
pages 80-82 and 166. In a sense, this equation is a generalization of Haldane's 
Principle to our age-dependent models for mutation-selection balance under 
Free Recombination with point-mass profiles. The quantity equal to the total 
mutation rate, however, is not the overall loss of fitness for the population. 
The overall loss in fitness depends only on the aggregate population hazard 
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rate, so it remains independent of the sizes of mutational effects as Haldane 
posited, but not entirely independent of the age pattern of mutation rates. 

8 Conjectures and Conclusions 



The derivations in this paper show that non-linear interactions can make 
a profound difference to patterns of senescent mortality produced from muta- 
tion accumulation. Breakdowns can occur not only when rates of deleterious 
mutations are high but, in the simplest cases, whenever they are not zero. 

The setting with point-mass mutations is the one most thoroughly studied 
in the past, going back to Hamilton himself. It is a stylized setting. Do our 
main conclusions hold when the age-specific effects of deleterious mutations 
are not concentrated at single ages but spread across a range? 

We conjecture that unraveling does occur not only with point-mass pro- 
files but with profil es from translation families of the kind also treated by 
Charlesworthl (1200 ll ). Let the functions n(m,x) each vanish below some age 



of onset a(m) and have the same shape above it, being shifted versions of 
some template. We expect that a mutation rate v constant over ages of onset 
still leads to unraveling. We also expect that the shapes can vary to some 
extent and that the strict absence of effects below the ages of onset is the 
feature that drives models to unravel. With mutation rates that taper with 
age, such absence leads to early Walls of Death. 

A reasonable way for nature to avoid such pathologies would be for delete- 
rious mutations to have small effects of at least some minimal level at young 
ages even when their main effects are concentrated later. Such a requirement 
would keep genetic loads from growing to infinity. In the process, it would 
introduce a tendency for hazard rates to tend toward plateaus at high ages, 



Charlesworthl (120011 ) suggested this condition within the linear framework 
as a way of generating plateaus. With the non-linear model, the argument 
becomes stronger. The necessity to avoid unraveling becomes a reason for 
expecting plateaus. 

We have seen that Gompertz-Makeham hazards arise quite easily at young 
and medium ages from our non-linear model with point-mass profiles and 
tapering mutation rates. We conjecture that they also arise with distributed 
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mutational profiles of a more realistic kind. Applications of our model with 
richer families of profiles will be presented in a sequel. 

A number of aspects in the application of our non-linear model await 
examination. These include 

• possible closed-form solutions with point-mass profiles augmented with 
fixed early-age selective costs; 

• inclusion of mutations depressing fertility as well as augmenting haz- 
ards, with point-mass profiles and with more realistic profiles; 



extension of the results of iBaudischl (120081 ) for effects that act multi 



plicatively on hazards, exploiting the full non-linear model; 

• comparisons of predicted hazards between our model with Free Recom- 
bination and the alternative model without recombination; 

• study of intermediate assumptions about recombination and their im- 
plications for unraveling and Walls of Death; 

A larger goal is to begin to integrate the non-linear models for mutation 
accumulation examined here with models for other contributors to senescent 
processes. Mutation accumulation does not act in isolation. It reshapes 
vital schedules that themselves reflect cellular and organismic processes and 
considerations of life-history optimization in interactions with environments. 
Mathematical modeling of mutation accumulation is a point of departure for 
further enhancements of our evolutionary understanding of senescence. 
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